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Abstract 

We compare several instances of pure-state Belavkin weighted square-root measurements from 
the standpoint of minimum-error discrimination of quantum states. The quadratically weighted 
measurement is proven superior to the so-called "pretty good measurement" (PGM) in a num- 
ber of respects: 

1. Holevo's quadratic weighting unconditionally outperforms the PGM in the case of two- 
state ensembles, with equality only in trivial cases. 

2. A converse of a theorem of Holevo is proven, showing that a weighted measurement is 
asymptotically-optimal only if it is quadratically weighted. 

Counter-examples for three states are constructed. The cube- weighted measurement of Ballester, 
Wehner, and Winter is also considered. Sufficient optimality conditions for various weights are 
compared. 
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1 Introduction 



1.1 Optimal measurements 

Consider an ensemble £™ lxcd of mixed quantum states p k with a priori probabilities pk, k — 1, .., m, 
and unit normalizations Tr p k = 1 . Of fundamental importance is 

The minimum-error quantum distinguishability problem: If an unknown state 
Pk is blindly drawn from the ensemble, what is the chance that the corresponding value of 
fc may be correctly identified by performing an optimally chosen quantum measurement? 

The modern approach to this problem is to consider measurements defined by 

Definition 1 A positive-operator valued measure (POVM) {Mk} (see, for example, p. 74 
of 0/ ) is a collection of positive semidefinite operators on a Hilbert space Ji such that ^2 Mk = & ■ 
The probability that the value i is detected when the POVM is applied to the state pj is given by 
Pi\,j = Tr Mipj. In particular, the success rate for the POVM to distinguish the ensemble £™ zxed 
is given by 

m 

Psucc — ^^Vk Tr (pkMk) ■ (1) 

k=l 

Minimum-error quantum measurement was first considered in the 1960s in the design of high 
performance optical detectors More recently, this problem has been fundamentally important 
in quantum Shannon theory (for example 043) ano - m construction of quantum algorithms for the 
Hidden Subgroup Problem. [ a-llCI . Various necessary and sufficient conditions for optimal measure- 
ments have been derived [Ill4l6l | (see also [l7j), A number of relatively recent works give interesting 
general upper and/or lower bounds on the quantum distinguishability problem, pi Il8l42{| Explic- 
itly solving the general optimal measurement problem is most likely impossible, but in specific 
numerical cases one may compute the optimal measurement by numerical iteration [2EI- 28| or by 
numerical solution of the associated semidefinite program [171 ] . 

The optimal measurement problem has been generalized to wave discriminati on 12911 and to op- 
timal reversals of quantum channels, in the sense of average entanglement fidelity [19i. I30h34| . More 
recently, the success-rate of optimal measurements has been expressed in terms of the conditional 
min-entropy of corresponding classical-quantum states. (See Theorem l of [35j|.) 

1.2 Belavkin's Theorems 

In the rest of this paper we shall restrict consideration to the ensemble 

£m = {tyk,Pk)}t-l,...,m ( 2 ) 
of pure quantum states ipk € TL 1 and consider POVMs given by 

Definition 2 The Belavkin Weighted Square Root Measurement (BWSRM) fl2. \3b\] (also 
known as a Weighted Least-Squares Measurement \3^] ) with weights Wk > is the 

M k = W t \<pt) (H^j W k iVfc) {M (j2 Wt \i/> t ) (i>e\\ . (3) 

on the linear span of the Wk \tpk)- 

The importance of BWSRMs in minimum-error discrimination problems was shown by the 
following 



lr The negative fractional power is well-defined on the restriction to the span of the W^ipk- More properly, one 
may define A -1 / 2 = ^ X k 1//2 \(j>k) (0feli where A = y ^A^ \4>k) {<t>k\ ls a spectral-decomposition with Aj, = terms 
omitted. 
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Theorem 3 (Belavkin 1975 [3, [13]) A POVM {M k } on Span(£ m ) = Span {{p k ip k }) is opti- 
mal if and only if it may be expressed as a BWSRM with weights W k such that the operator 

I m \ 1 / 2 

A=[J2Wi\^)(M) (4) 



is invertible on Span(£ TO ) and 
with equality when W k >O0 



P* <^fc| A" 1 |Vfc> < 1, (5) 



Note that a simple formula for a set of optimal weights corresponding to a given optimal mea- 
surement 

W^ = ^ k \M^\^ k )xpl (6) 

follows by squaring both sides of ((5|) and multiplying by W k , whether or not W k = 0. Furthermore, 
Belavkin's theorem implies that optimal measurements on Span (£ m ) satisfy Rank (M^ pt ) < lH 

Belavkin and Maslov generalized Theorem[3]to mixed states (and more generally to wave pattern 
recognition) in section 2.2 of [29(. Iteration of a mixed state version of equation (|6|) was explored 
in [27L |28| as a method for numerical computation of optimal measurements. 

Many of the known exactly solvable optimal pure state measurements are special cases of 



Theorem 4 (Belavkin 1975 [14, 36], also Ban (4ll ] ) The measurement © for the weights W k 



p k is optimal for the pure state ensemble £ m if the chance of successfully identifying a given state 
ip k is inversely proportional to its a priori probability^ 

p k (ip k \ M k \ip k ) = const. (7) 

Note that condition ([7]) is sufficient but not necessary as may be seen by considering direct sums. 
Belavkin originally applied Theorem 2] to homogeneous systems, cyclic systems, and systems of 
coherent states. 36] (A generalization of cyclic systems appears in [42|). 

1.3 Sub-optimal measurements 

In abstract studies of quantum channel capacities or quantum algorithms, numerical routines for 
solving specific instances of optimal measurement problem are often neither feasible nor desirable: 
one often has to rely on sub-optimal measurements. Several extant approximately optimal mea- 
surements are examples of 

Definition 5 Forr > 0, the Belavkin power-weighted square-root measurement (BWSRM- 

r) is the BWSRM with weights W k =p r k . 

Examples of BWSRM-r's appearing in the literature correspond to r = 1, 2, 3. Note that in the 
case of equiprobable (p k — 1/m) pure states that all BWSRM-r's are identical, and are of pervasive 
utility in quantum information theory. (See, for example Q.jl 



2 Mochon rediscovered that every optimal pure-state measurement may be expressed as a BWSRM. [3811 

3 For mixed states, Rank ^M^ pt ^ < Rank(p^) [13 . See also equation 5 of [27ll and \vh . Equation \E\ may be 

understood geometrically using "frame forces," which have been advocated by Kebo and Benedetto |39tl40l|. 

4 The weighted measurement defined by W k = Pk sometimes appears as M k = |efc)(ej.|, with |ej.) = 
5Zfc=i IV^) (-P - 1 ^ 2 )ik> wnere P i s t ne Graham matrix Py = yJPiPj {ipi, i/>j). Condition JTJl is then written as 
Vr = (\/p) for all The equivalence of these two formulations follows from the matrix identities 

\ / a V / jj 

rt (rrt)~ 1/2 = (rtr)" 1/2 rt andr(rtr)" 1/2 rt = (rrt) 1/2 = VP, where r = E*Liv^Wc> Wfclw : n -> cm - 

Here {K)j;7n} is the standard orthonormal basis of <C m . 

5 The study of BWSRM-r's as approximately-optimal measurements in the equiprobable case goes as far back as 
and |H. 
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We have already encountered the r — 1 case in Theorem 01 This measurement came to be 
known as the "pretty good measurement," (PGM) because of its reintroduction two decades later 



by Hausladen and Wootters as an ad hoc approximately optimal measurement 4J, |45[ with simple 



error bounds. Barnum and Knill showed that the failure rate of the mixed-state version 

< GM = (E««)~ 1/S W* (E^r /2 

of the Belavkin-Hausladen- Wootters PGM satisfies the bound 

< ^i? M < i£S? (1 + PsZ\) < (8) 
where P f °?j is the minimum-error failure rate. .19, 20] The bound 

i^ M <$>K^,Vi)l a (9) 

was proved by Hayden ef [18| , generalizing the equiprobable bound of 

The cube-weighted BWSRM-3 was employed by Ballester, Wehner, and Winter in the study of 
state discrimination with post-measurement information. 46|, 47 1 



1.4 Asymptotically-optimal measurements & BWSRM-2 

The quadratically weighted BWSRM-2 will be of particular interest in the present work, and its 
mixed state generalization will be studied in the sequel. Relatively recently, the mixed state version 
of BWSRM-2 has appeared as the first iteration in a sequence of closed form measurements which 
appear to converge to the optimal measurement . (27l. [28| . This weighting was first specifically 
considered by Holevo, who was most interested in the case of nearly orthogonal ip k - 

Definition 6 A me asu rement procedure G for distinguishing the pure-state ensemble £ m is asymp- 
totically optimal \&&] if for fixed pi, ...,p m one has 

Pfail (£m) 

as the states i\) k approach an orthonormal basis^\ 
Holevo showed that 

Theorem 7 (Holevo's asymptotic-optimality Theorem (1977) [48]) The quadratically-weighted 
pure state Belavkin measurement BWSRM-2 is asymptotically optimal. 



As we will see in section [2721 this property is not shared by the "pretty good measurement." 
The key idea in Holevo's proof was the construction of BWSRM-2 using an approximate minimal 
principle: 



Theorem 8 (Holevo 1977 [48].) Assume that the states ip k are linearly independent. Then the 
von Neumann measurement M k = \e k ) (e k \ minimizing 

m 

C Holevo ({e k }) = J^p k U k -e k \\ 2 (10) 
fe=i 

over orthonorma^ sets {e k } is the quadratically weighted Belavkin measurement BWSRM-2. 



6 Equation ([9} follows by summing the conditional error bound (A6) of [18l |, Bounds based on the pairwise 
quantities \{ipi, tpj)\ 2 are inherently limited [Soli , although frequently useful. 

7 It is presumably intractable to produce a closed-form measurement process G for which 
^fail (^ m ) /^ftjl™ (^™) — > 1 as the i/ij, and pj, are arbitrarily varied in such a way that p^P tlma (£m) —* 0. 
Otherwise, one could recover the optimal measurement for a fixed ensemble S m on H by taking the A — » 1 — limit of 
the ensemble £' m+1 = {( 4'k i (1 — ^) Pk)} U {(0, A)} on a dilation H'dH, with rf>A/H. 

8 Orthogonal measurements are optimal for distinguishing linearly-independent pure states. 1, 36, 38, 49] 
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Theorem[3]was generalized by Eldar and Forney 37f , who showed that the BWSRM with weights 
Wk minimizes C^ Wk ^ = ^2 Wk \\ipk — e fe|| over POVMs Mk — |e&) (e^ | without any assumption of 
linear independence 

1.5 Results 



In section 12.31 it is shown that a weighted measurement is asymptotically optimal only if it is 
quadratically weighted, proving a converse to Holevo's asymptotic optimality theorem. In section 
2.21 the PGM is found to be categorically worse than the quadratically weighted measurement for 
two pure states. In section 12.41 we make a heuristic comparison between various weightings, and 
present a counter-example to show that the relationship between weightings is more complicated 
for ensembles of more than two states. Finally, in section 12.51 we compare sufficient optimality 
conditions for various weightings. 



2 Pure State weighted measurements 
2.1 Continuity 

Although weighted measurements are defined using the singular map x i— > x -1 / 2 , one still has 

Theorem 9 For fixed weights Wk, k — l,...,m, the success rate of the weighted measurement for 
distinguishing the pure state ensemble £ m = {(ipk>Pk)}k=i m * s a jointly continuous function of 
the tpk an d Pk ■ 

Proof. Define the operator A : C m — > TL by 

m 

A = Y,Vw~ k \4>k) H (fc| cra , (11) 
fc=l 

where {|fc) Cm } is the standard orthonormal basis of C m . Then 



pW- weighted _ 

1 succ / / Pk 

k=l 



"1/2 



i=i 



E|| (fe| cm At(AAt)- 1/2 A|fc) t 



fc=l 



m 

=Ei(w(^) +1/2 w 

Continuity of Pf a n follows from the continuity of the square rootp 



(12) 



9 One can recover this generalization from Holevo's argument using Naimark's Theorem. |39( Note that the cost 
function C^° lcv ° for arbitrary p k already appears as eq. 8 of [48l . 

10 By the Weierstrauss approximation theorem 15011 . given e > one can find a polynomial P such that 

|p(A) — -\/a| < e for all A in the interval / = [0, J~] \ . Since / contains the spectrum of At A for any choice 

of {V"fc}i continuity of (1 1 2 I t is guaranteed by Theorem 7.12 of |50l . 
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2.2 Explicit comparison of weighted Belavkin measurements for 2 pure 
states 

We first consider binary ensembles: 

Theorem 10 The failure rates for distinguishing the binary ensemble £ 2 using optimal and weighted 
measurements are given by 



D o^mal = 1 _ (13 ) 



pweighted _ (PlW 2 + Pi W\ ) COS 2 , ( { , 

faU W x + W 2 + 2VW1W2 I sin 6\ ' 



where cos 6 = \ {tj)x,tp2}\- 



Proof. Equation (fill]) is equation 2.34 on page 113 of [lj. 

For an arbitrary 2x2 positive matrix B it is easy to use the spectral theorem to verify that 
B +i/2 _ ( 2v ^MB + TtB] (b + VdctB x 1 



For A defined by (|ll|l one has 

detA f A = VFiW^sin 2 ^ 
TrA^A = Wi + W 2 . 

Equation (fT4| now follows from ([12]) : 

,+1/2, 



= Efp" ( 2 VWW^ |sin0| + Wi + W 2 ) (w k + \/WiW 2 |sinf| 



= 1 - 



-1/2 

I + H/i + 14/ 2 J 

^PfeVFi-fcCos 2 ! 



Wi + W 2 + 2^JW X W 2 |sin ( 



Theorem 11 (Holevo's measurement is better than the PGM for two pure states) For dis- 
tinguishing the 2-pure-state ensemble £ 2 one has the following inegualities 

pPGM ^ rjHolevo /1 r\ 

*f a iX £- -f/oii ) V 10 / 

pPGM pHolevo rx , 

2 = sup —r- ; > sup .. j = — - — « 1.207 (16) 

pcubic weighting 

> sup fail .. , « 1.118, (17) 

with equality in (|_?5[) iff Pi,p 2 € {0,1/2,1} or ('01, V^) = 0. 

Proof. To prove (fT5|) . note that since ^Jp\p 2 < 5, we have the inequalities 

1 , 

2 + vW2 < 1 

Vpl^(|sin0| - 1) < 2p lP2 (|sin0| - 1) . 



G 



Figure 1: Pfau/ifaj* for the PGM (upper) and Holevo's 
measurement (lower) for binary ensembles 
with | (^i, ■02)1 = cos 9 and p\ = 1 — p2 = p 



Summing gives 

^ (l + 2 1 /pl^|sin0|) < 1 + 2pip 2 (|sin6»| -1) = p\ + p\ + 2 Pl p 2 sin |0| . 

Equation (fT5| follows by dividing pip 2 cos 2 9 by both sides and applying (Tl4f . 

The equation on the left-hand side of (fTBj) shows that the bound ([8|) of Barnum and Knill is 
sharp. To see that 

p f p G M 

poptimal — ^' 
£a Mail 

take p\ — ► + for any fixed (ipi,ip2) ^ 0. The equation on the RHS of (TlB|) is an unilluminating 
exercise in calculus. The maximizing ensemble is given by V'l — ^2 and pi = \/2/2. The last 
inequality (|17p was computed numerically. ■ 

The relative success rates of Pfaii /-Pfai* lmal f° r the weightings r — 1,2, and 3 of measurements 
on the ensemble £2 with | (^i, 1^2) \ = cos9 and p\ = 1 — P2 = p are plotted in Figures 1 and 2a/b. 
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2.3 Asymptotic optimality 

Holevo's quadratic weighting is uniquely characterized by the following converse of Theorem [71 

Theorem 12 (Converse to Holevo's asymptotic optimality Theorem) Fix probabilities pk > 
and weights Wk > 0, k = 1, ...,m. Then the Belavkin weighted square root measurement (J3J) is 
asymptotically optimal for distinguishing ensembles £ m — {(ipk,Pk)}k=i m on ^H tfWk =constxp^. 

Proof. Setting Cfc = Wk/pi, we must show that Cfc = Ck> for all fc, k' under the assumption that 
{Wfc} defines an asymptotically optimal measurement. It is sufficient to consider the case m — 2p^l 
By Theorem [TOl and L'Hospital's rule 

P* d Pl W 2+P2 W 1 cos 2 c? 



lim ta 'V , = — - — J- x lim 

^optimal " - - " " • 



(c 2 p 2 + cipi)pip 2 1 



/WT + VW) 2 PlP2 

C\P\ + c 2 p 2 



The conclusion follows from the strict convexity of x i— > a; 2 . ■ 

2.4 Reflections & Counter-examples for three states 

We now reflect on the relationships between Belavkin's optimal weighting (Wk — p\ (V>fc| M^ pt IV'fe))? 
the weighting for the PGM (Wk — Pk), Holevo's weighting (Wk — p%), and the weighting of 
Ballester, Wehner, and Winter (Wk — Pk)- Note that while Holevo's measurement relatively over- 
weights vectors ^fe for which (ipk\ M^ pt \ipk) is relatively small, the PGM additionally over- weights 
vectors for which pk is small! In general, one therefore expects that the relative misweightings of 
the PGM tend to compound one another, so that BWSRM-2 is better that BWSRM-1. Similarly, 
by approximate cancellation of misweightings, one expects that the cubic weighting will sometimes 
outperform the quadratic weighting for ensembles far from the asymptotically orthogonal regime 
considered by Holevo. 

We have seen that Holevo's measurement is always as least as good as the PGM for two-state 
ensembles. For three states the above intuitive argument does not always hold true, as shown by 
the following pathology: 

Theorem 13 There exists a 3-state ensemble with the properties that: 

1. There is an optimal measurement such that the a priori strictly-most-probable state is NEVER 
detected. 

2. Holevo's measurement is worse than the PGM: Pf a °{ em > Pf a fi M ■ 

Proof. Define the 3-state ensemble by ipi = (cos 8, sin0), ip 2 = (cos 8, — sinfl), and ^3 = (1,0), 
where 8 = 7r/6 and 

Pi =P2 = (1-Pa) /2 = (2 + (cos A + sin (9) cos (9) _1 » .3142 < 1/3. 
It is straightforward to check that the POVM 

Mi = \ : ; m 2 i ■ , .,' i/, = (i 



1 


1 




' 1 -1" 


M 2 = \ 


1 


1 




-1 1 



11 The case m > 2 is reduced to m = 2 by considering ensembles for which each of a subset m— 2 states is orthogonal 
to all of the other states in £ m . 



8 



satisfies the necessary and sufficient optimality conditions [13j 

(L + it) /2 - p k \ip k ) (V» fc | > for all fe, 
where the Lagrange operator L is given by 

3 



L = Y,Pk M k \i> k ) (t/> k \ = Pl (cos (9) + sin (9)) 



cos 9 

sin( 



fc=i 

(Here A > means A is positive semidefinite.) Property [5] follows by direct computation: 

pHolevo _ 4245 > pPGM _ 4224 > p f °P«*»al _ 

■ 

Remark: The linear-dependence of the states in the above construction was not essential: one 
can simply embed the above example in 3-space, and perturb the vectors ip k slightly to make them 
linearly independent. By Theorem [9l property [2] will be unaffected by small perturbations. 

Theorem [13] aside, given any fixed set of non-equal priors p k , we conjecture that Holevo's 
weighting will have a better success rate than the PGM on average for randomly chosen ensembles 
£ m , with the corresponding {tf) k } independently chosen according to Haar measure. 

2.5 Sufficient optimality conditions for weighted measurements 

We close our comparisons of Belavkin weighted measurements by noting that in the case W k > 0, 
Theorem U generalizes easily: 

Theorem 14 (Optimality conditions for positively-weighted measurements) A sufficient 
condition for optimality of the BWSRM with strictly positive weights W k > is that there exists a 
constant c > such that 

p\ (Vfc| M k = cW k for all k. (18) 



Proof. Dividing both sides of (fTg|) by cW k and taking the square root gives 

c- 1/2 Pk (M (J2 W ? IV*) = !• 



In particular, the rescaled weights c x Wi satisfy Belavkin's optimality condition ([5]). The result 
follows, since BWSRMs are unaffected by such rescalings. ■ 

The assumption that W k > for all k is necessary: otherwise the weights W k — 6 k i would be 
optimal for any ensemble. Note that for the asymptotically optimal weight W k = p\, equation (jTHjl 
becomes the particularly simple condition 



(ip k \M k \ip k ) = const. (19) 



3 Future directions 



In the sequel, we focus on the quadratically weighted mixed state measurement, and consider 
resulting two-sided bounds for the distinguishability arbitrary ensembles of mixed quantum states 



51 1 . We will generalize to the case of approximate reversals of quantum channels at a later date 
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